APPROXIMATE GROUPS III: THE UNITARY CASE 
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Abstract. By adapting the classical proof of Jordan's theorem on finite 
subgroups of linear groups, we show that every approximate subgroup 
of the unitary group Un(C) is almost abelian. 
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1. Introduction 

This paper is the third in a series concerning approximate groups, the first 
two papers in the series being [6, 7J. Let us begin by repeating the definition 
of "K-approximate group" due to T. Tao (see [30]). 

Definition 1.1 (Approximate groups). Let G be some group and let K ^ 1. 
A finite subset A <^ G is called a K -approximate groupij if 

(i) It is symmetric, i.e. if a £ A then a~^^ € A, and the identity lies in 
A; 

(ii) There is a symmetric subset A C G with \X\ ^ K such that C 
XA. 

Here, as usual, denotes the product set {0102! 01,02 G A} and XA 
denotes {xo|x € X,a £ A}. One of the main reasons for introducing ap- 
proximate groups was to understand finite subsets ^ in a group G satisfying 
a doubling or tripling condition, that is |^^| or |^^| is not much larger that 
1^1 . To a large extent, the classification of sets of small doubling reduces to 

^We make here a slight abuse of terminology, because our definition is not intrinsic and 
makes use of the ambient group G, so stricto sensu we define here approximate subgroups 
of G. 

1 



2 



EMMANUEL BREUILLARD AND BEN GREEN 



the classification of approximate groups. For the relation between the two 
concepts, we refer the reader to Tao's original paper [30]. We have chosen to 
work with approximate groups here, rather than directly with sets of small 
tripling (say), so as to be compatible with our previous papers and other 
work of the authors and Tao. Approximate groups also have one or two 
advantages over sets with small tripling - for example, they behave rather 
better under homomorphisms. 

Working with approximate groups, it is convenient to introduce the fol- 
lowing notion, defined by Tao in [31] . 

Definition 1.2 (Control). Suppose that A and B are two subsets of a group 
G, and that K ^ 1 is a parameter. We say that A is K-controlled by B, 
or that B K-controls A, if \B\ ^ K\A\ and there is some set X C G with 
\X\ ^ K and such that A C XB n BX. 

Let n ^ 1 be an integer, and write Un(C) for the group of unitary matri- 
ces. The main result of this note is the following. 

Theorem 1.3. Suppose that A C Un(C) is a K -approximate group and that 
K ^ 2. Then A is n^'^^ K^'^ -controlled by B, a K'-" - approximate subgroup 
o/U„(C), which consists of simultaneously diagonalisable matrices. 

Here C is an absolute constant which could be specified explicitly if de- 
sired. As a corollary of Theorem 11.31 we can deduce a more precise result 
along similar lines, albeit with somewhat worse bounds. 

Corollary 1.4. Suppose that A C U„(C) is a K -approximate group, K ^ 2. 
Then there is a connected abelian subgroup S C U„(C) such that A lies 
in the normaliser N{S), and such that the image of A under the quotient 
homomorphism vr : N{S) — )• N{S)/S has cardinality at most n^"' K^"' . 

From basic results on approximate groups, derived from the fundamental 
work of Ruzsa and developed in [iTl [30], we can also describe the subsets 
A C Un{C) such that l^^l ^ K\A\. They satisfy exactly the same conclusion 
as in the above corollary. The sets A with ^ -ft^l^l, on the other hand, 
do not have such a nice structure, although it is a direct consequence of 
the above that such sets are contained in at most n^^^ K^"^ cosets of some 
connected abelian subgroup S C U„(C) (see the remark after the proof of 
Corollarv ll.4|) . 

In the case n = 2, Bourgain and Gamburd \2\ proved a much stronger local 
version of Theorem 11.31 in which they considered covering numbers N{A,5) 
for every resolution (5 > 0, instead of merely counting the number of points 
in A as we do. However, their approached was based on the sum-product 
theorem (as used for example in the work of Helfgott [IT]) and does not 
seem to extend easily to the higher rank case. See nevertheless the recent 
announcement [3]. 

From the qualitative point of view, a much more general result than The- 
orem 11.31 is contained in the work of Hrushovski [19] and in later joint work 
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of the authorsS and Tao [8l El [TO]: in particular, the rough structure of ap- 
proximate subgroups of GL,i(C) is now understood, with the bounds in [9] 
being polynomial in K for fixed n just as in Theorem II. 31 We have decided 
however that it is nonetheless worth having the present argument in the 
literature, since it is completely different to these other arguments and con- 
siderably more elementary in that nothing is required by way of algebraic 
group theory, quantitative algebraic geometry or model theory. Apropos of 
the last point, our bounds are completely explicit, whereas those in [9j are 
not on account of the use of ultrafilters there. As they stand, we believe that 

2 

the methods of [9] would give a bound of the form 0„(i^'^" ) in Theorem 
11.31 with the 0„(1) being ineffective. In principl^ all uses of ultrafilters in 
our papers with Tao could be replaced by effective algebraic geometry argu- 
ments, thereby giving an explicit dependence on n; however it is extremely 
unlikely that in so doing one would beat the exponential dependence in n 
that we have attained in Theorem II. 31 Moreover, the power of K appearing 
in Theorem 11.31 is merely linear in n rather than exponential. We do not 
make any claim that this dependence is sharp - indeed we believe it possible 
that a bound of the form On{K^) is the truth in Theorem 11.31 where C is 
independent of n. The implied constant certainly cannot be independent of 
n and most grow faster than exponentially even in the case K = 1. This 
can be seen by taking A = Sym(n), the symmetric group on n letters. 

The proof in this paper can be viewed as an approximate version of the 
standard proof of Jordan's theorem on finite subgroups of U„(C), which 
states that such subgroups G have an abelian subgroup H with [G : H] = 

On{l). 

In addition to these remarks we note that Theorem 11.31 can be used as a 
substitute for the so-called Solovay-Kitaev argument (see the appendix to 
|24j ) which features in the variant of Kleiner's proof of Gromov's theorem 
on groups of polynomial growth due to Shalom and Tao [27]. In fact the 
arguments of our paper offer a new elementary proof of the fact, traditionally 
derived from the Tits alternative, that finitely generated subgroups of U,i(C) 
with polynomial growth are virtually abelian. While the Tits alternative 
implies the exponential growth of non- virtually abelian subgroup of U„(C), 
our arguments fall short of this. They do however give a super-polynomial 
lower bound on the size of a word ball or radius r of the type exp(r") for 
some a = a{n) > 0. We will remark further on this connection in ^ 

Notation. The letters c, C stand for absolute constants; different instances 
of the notation may refer to different constants. All constants in this paper 
could be specified explicitly if desired. 

^Added during revision: Pyber and Szabo have recently extended their approach in [25| . 
which appeared simuhaneously with [S], to cover fields of characteristic zero as well as 
finite characteristic as in their original work. 

■^In the most recent version of their preprint |25) . Pyber and Szabo record effective versions 
of all the algebro-geometric arguments used in their work. It is not immediately clear to 
us exactly what explicit bound this could possibly give in our main theorem. 
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2. On Jordan's Theorem 

In a sense, the main idea of our paper is to take a proof of Jordan's 
theorem on finite subgroups of U„(C) and then modify it so that it works 
in the context of approximate groups too (note that a subgroup is precisely 
the same thing as a 1-approximate group). 

Theorem 2.1 (Jordan [20]). Suppose that A is a finite subgroup o/U„(C). 
Then there is an abelian subgroup A' <^ A with [A : A'] ^ F{n). We can 
tak^ F{n) = n*^"^ for some absolute constant C . 

Jordan's original proof was a very ingenious variation on the theme of 
the celebrated classification of Plato's solids, and was mainly algebraic. The 
proof we give here however is mainly geometric. It is a slight variant, which 
we learned from the weblog of T. Tao [32], on the classical proof of Jordan's 
theorem given for instance in |12j . itself based on arguments of Bieberbach 
and Frobenius (see [UllH]). The argument relies on the basic fact that the 
commutator of two elements close to the identity in a Lie group is itself much 
closer to the identity. This idea has been used repeatedly ever since (it is 
nowadays also sometimes referred to as the Zassenhaus-Kazhdan-Margulis 
trick, see [29, chap. 8]) and is also the main tool in the Solovay-Kitaev 
algorithm [24] mentioned above. 

We remark that Jordan's theorem actually applies to finite subgroups of 
GL„(C), but the first step of the proof is to apply Weyl's unitary trick to 
reduce to the unitary case. No analogue of this trick appears to be possible 
in the context of approximate groups. 

Suppose then that j4 is a finite subgroup of U„(C). The key observation 
is the following very well-known fact. The reader may also find a nice 
explanation of cognate ideas in the proof of [lf>\ Lemma 4.7]. 

Lemma 2.2 (Element with large centraliser). Suppose that A C U„(C) is 
a finite group. Then at least one of the following holds: 

(i) There is a subgroup A' ^ A consisting of scalar multiples of the 
identity with [A : ^4'] ^ n*^"^ ; 



That is to say, our proof gives a bound of this form. Completely optimal values of F{n) 
axe known by more sophisticated arguments, the latest of which, due to M. Collins [11) . 
give the sharp bound F(n) ^ (n+ 1)! for n large enough and make use of the classification 
of finite simple groups. The slightly strange use of the letter A in this statement is so that 
it may easily be compared with results in later sections. 
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(ii) there is an element & A, not a scalar multiple of the identity, 
whose centraliser Ca(7) := {x E A : xy = ^yx} has cardinality at 
least n-^"Vl- 

Proof. Equip Mat„(C) with the Hilbert-Schmidt norm: take some or- 
thonormal basis ei, . . . , e„ for C" and define ||M|| := ■ |mjj p)-*^/^, where 
the niij are the matrix entries of M with respect to this basis. It is well- 
known that this is an algebra norm, that is to say ||MiM2|| ^ ||Mi|| ||M2||, 
and we shall use this fact several times. Every unitary matrix has norm ^/r^, 
and the Hilbert-Schmidt norm is invariant under left and right multiplica- 
tion by unitary matrices. Let d be the distance induced by this norm, that 
is to say d{x,y) := \\x — y\\. 

We claim that A', the subset of A consisting of all elements with distance 
at most l/4:^/n from the identity, has cardinality at least n"*^" \A\. To see 
this, observe that a simple volume-packing argument implies that Un{C) 
may be covered by n^"^ balls of the form {g G Un{C) : d{g,go) ^ l/4^/n}. 
At least one of these balls contains at least \A\ elements of A. However 

for all of these elements g we have 

\\99o^ - In\\ = Wg - 9o\\ ^ l/4:^/n. 

This is establishes the claim. We now distinguish two cases. 

Case 1. Every element of A' is a scalar multiple of the identity. Then we 
clearly have alternative (i) in the statement of the lemma. 

Case 2. At least one element of A' is not a multiple of the identity. Let 
7 G be that amongst the elements of A' which are not scalar multiples 
of the identity for which (i(7, /„) is minimal. Then \i x ^ A' is arbitrary we 
have 

d{[l,x\,In) = d(7X7"^a;"\ In) 

= 11(7 - /„)(x - /„) - (x - 7„)(7 - /n)|| 

^ 2||7 - /„||||x - In\\ 

^d(7,/„)/2. 

Since A is a group, the commutator [7, x] is an clement of A. If it is a scalar 
multiple of the identity then, since det[7, x] = 1, we must have [y,x] = 
e^nir/nj^ for some r G N. 
Note that if r 7^ we have 

d^e'^^ir/nj^^ 7„) = le^^^^^Z" - l\n^/'^ ^ \ sin(7r/n)|nV2 ^ 2/^^. (2.1) 

Since ^([7, x],/„) ^ 1/4-^/n this implies that [7,x] = /„. If [7, x] is not a 
scalar multiple of the identity then, by the asserted minimality of 1^(7, /„), 
we are also forced to conclude that [7, x] = In- In either case we have 
established that x commutes with 7, and hence the whole of A' lies in the 
centraliser Ca(7). This is option (ii) in the statement of the lemma. □ 

Let us recall now the following standard fact. 
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Lemma 2.3 (Centralizers). Let 7 G U„(C) is not a multiple of the identity. 
Then the centraliser Cu„ (c)(7) is isomorphic to a direct product Uni(C) x 
• • • X U„j.(C), where ni + ■ ■ ■ + = n and ni < n for all i. 

Proof. The matrix 7, being unitary, is diagonalisable. Its centraliser in 
GLjj(C) may therefore be identified with GL„^(C) x • • • x GL„^(C), where 
rei + • • • + = n and < n; the integers ni are of course the multiplicities 
of the eigenvalues of 7. It is clear that the intersection of such a block 
subgroup with U„(C) is precisely U„^(C) x • • • x U„^(C), and this completes 
the proof. □ 

We may now complete a proof of Jordan's theorem, proceeding by in- 
duction on the rank n. Supposing that ^ is a finite subgroup of U„(C), we 
apply Lemma [2. 21 If option (i) holds then we are done; otherwise, option (ii) 
holds and we have a subgroup Z (the centraliser in A of some 7, not a scalar 
multiple of the identity) of size at least n"*^" |^| and which is isomorphic 
to a subgroup of U,i^(C) x • • • x U„j.(C) where k ^ n and rii < n for all i. 
Writing vTj : Z — >• U^. (C) for projection onto the ith factor, it follows from 
the induction hypothesis that there is an abelian subgroup Zj C 7rj(Z) with 
[TTi{Z) : Zi] ^ F{ni). The subgroup B = riiTT^^{Zi) C Z satisfies 

\Z\ n-^'^' 
' '^F(ni)...F(nfc) ^ F(ni) . . . F(nfc) ' 
and is abelian. So Jordan's theorem follows provided that 

F{n) ^ F{ni) . . . F{nk)n-^''\ 

That a function of the form F{n) = n*^"^ satisfies this inequality is an 
immediate consequence of the following elementary lemma. □ 

Lemma 2.4. Suppose that n ^ 2 and that ni, . . . ,nfc are positive integers 
with Ui < n for all i and ni + • • • + = n. Then 

> nl-\ h n| + n^. 

Proof. It is immediate by convexity or direct verification that {x — 1)^ + 
(y + 1)'^ > + whenever x, y are positive integers. Thus the maximum 
value of nf + •••-!- n| subject to the constraint ni + • • • + = n occurs 
when k = 2 and ni = n — 1, 722 = 1. The result then follows immediately 
from the inequality 

= (n - 1)^ + 1 + n^(3 - -) > (n - 1)^ + 1 + n^. □ 

n 

3. Approximate subgroups of the unitary group 

We turn now to the proof of Theorem 11.31 We do this by modelling the 
proof of Jordan's theorem given in ^ starting with Lemma 12.21 the lemma 
which located an element with large centraliser. We saw in Lemma 12.21 that 
multiples of the identity were slightly troublesome. To ease these issues we 
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work for now with the special unitary group SU„(C) := {g £ U„(C) : det g = 
!}• 

Lemma 3.1 (Element with large centraliser) . Suppose that A C SU„(C) is a 
K -approximate group with \A\ > n. Then there is an element j £ A"^ which 
is not a multiple of the identity and commutes with at least n~^^ K~^\A\ 
elements of . 

Proof. Since we are working in SUn(C), the only multiples of the identity 
are e^'^*^/"'/„ with r G N. Since \A\ > n, there certainly is some 7 G 
which is not a multiple of the identity. Since 7 commutes with /„, which 
is an element of A"^, the lemma is trivial whenever |^| ^ n*^" . Assume 
henceforth that 

\A\ > rf'^\ (3.1) 

This is a variant of an argument pioneered by Solymosi [28] in the context of 
sum-product estimates for C. For each a £ A select an element a* £ A \ {a} 
which is nearest, or joint-nearest, to a in the sense that d(a, a*) ^ d{a,a') 
for all a' £ A (where d is, as in the previous section, the distance induced 
by the Hilbert-Schmidt norm). Write := d{a,a*). Consider the map 
Tp : Ax Ax A^ X A^ X A"^ X defined by 

^/^(a, 01,02) := (oio, Oio*, 002, 0*02). 

It is certainly the case that aia is "near" aia* , and that 002 is "near" 0*02. 
If it was in fact the case that aia* was the nearest point in A"^ to oio, and 
0*02 the nearest point in to 002, we would clearly have | im?/^| ^ jyl'^p. 
Since \A^\ ^ it would follow that some fibre of ip has size at least 

l^l/ET. But if ip{a,ai,a2) = tp{b,bi,b2) then we have, of course, oio = bib, 
aia* = bib*, 002 = 662 and 0*02 = 6*62. Writing 7 := a~^a* = b~^b* we 
would have 

070-1 = 0*0-1 ^ (o*02)(oo2)-i = {b*b2){bb2)-^ = b*b-^ = bjb-^ 

and hence b~^a G Cg(7). As a consequence, \Cg{'j) n A"^] ^ 

To turn this into a proof of the lemma we must resolve two issues. First, 
we need to ensure that 7 is not a multiple of the identity. Secondly and more 
seriously it will not, in general, be the case that oio* is the nearest point in 
A'^ to oio. Regarding this second point it turns out that something a little 
weaker is true: for many triples (0,01,02) there are not many points of A^ 
closer to oio than oio*, and not many points of A^ closer to 002 than 0*02- 
In what follows, write -B„ for the weak Besicovitch constant of Mat„(C); 
see Appendix |A] for a full discussion, and a proof that -B„ ^ C" . We will 
examine well-behaved triples (o, oi, 02) for which oio is "almost" the nearest 
neighbour of oio* in A^ in the sense that 

Ua,a^ ■■= \{u G A^ : d(oio, u) ^ r4K WB^K, (3.2) 

for which 002 is "almost" the nearest neighbour of 0*02 in the sense that 

Va,a2 ■■= \{v e A^ : d{aa2,v) ^ r4| ^ WBnK, (3.3) 
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and for which 

a~^a* is not a multiple of the identity. (3-4) 

It is not obvious that there are any well-behaved triples, but we claim that 
this good behaviour is quite generic in the sense that there are at least | 
well-behaved triples. 

Let us first count the triples (0,01,02) £ A x A x A for which (13. 4p is 
violated. Since we are working in SU„(C), the only multiples of the identity 
are e^'^*''/"/^ with r G N, and so by the same computation we used in (|2.ip we 
get d{a~^a*,In) ^ Ij^fn and so = (i(o,o*) ^ 2/y/n. By a simple volume- 
packing argument the number of a with this property is at most n , and 
so (13. 4j) is violated for at most n^^ < |Ap/10 triples (0,01,02), this 
last inequality being a consequence of (|3.ip . 

Turning now to the examination of (13. 2p . fix oi. Then the open balls 
Br^{aai), a £ A, have the property that no centre ooi of one of these balls 
lies inside any other ball Br^,{a'ai). Indeed, if this were the case then we 
would have d{a,a') < r^', contrary to the assumption that a'* is the closest 
point of A to a' . It follows from the definition of the weak Besicovitch 
constant Bn that no point u S Mat„(C) can lie in more than Bn of these 
balls. It follows that 

Y.Ua,a, Bn\A^\ ^ BnK\A\. 
a 

An essentially identical argument using (j3.3p implies that 

^K,a2 <.Bn\A^\^BnK\A\. 
a 

The number of pairs (o, oi) for which Ua,ai ^ WBnK is thus at most |Ap/10, 
as is the number of pairs (0,02) for which Va,a2 ^ lOBnK. It follows from 
this that there are at least |^P/2 well-behaved triples, as claimed. 
Let us now consider the map defined above, 

■(/'(o, 01,02) = (oio, oio*, 002, 0*02), 

restricted to this set S of at least |j4|^/2 well-behaved triples. We claim that 
im{ip\s) is reasonably small; this implies that ip has a large fibre, and we 
may then conclude as in the simplified sketch above. 

Suppose, then, that {x,y,z,w) S im{'tp\s)- There are at most choices 
for X, and the same for z. Once these have been specified, consider the 
possible choices for y. Single out one of these, y, corresponding to the 
well-behaved triple (0,01,02) with d{x,y) = d{a,a*) maximal. Then for all 
permissible y we have 

d{aia,y) = d{x,y) ^ d{x,y) = d{aia,aia*) = d{a,a*) = Va- 

Since (0,01,02) is a well-behaved triple, it follows from (j3.2p that there are 
at most IQBnK choices for y. Similarly, there are at most IQB^K choices 
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for w. It follows that 

|im(V'|s)| ^ {WBnKf\A^f, 

2 fill 

and so tjj has a fibre of size at least C"" i^~"|yl|. By precisely the same 
argument used in the informal discussion at the start of the proof, this 
implies the result. □ 



Added in revision. Upon seeing our paper, and in particular noting our 
idea of mimicing the proof of Jordan's theorem in the approximate group 
setting, the referee came up with an elegant and simpler argument for prov- 
ing (a very slight variant of) this pivotal lemma which he or she was generous 
enough to share with us. We sketch this now. First of all look at A' , the 
elements of A"^ at distance at most 1/4-^/n from the identity. As remarked 
above, none of these are multiples of the identity. By a simple volume- 
packing argument, \A'\ ^ \A\. Let p be the minimum value of ^(7, In) 
over all 7 G A'. Suppose that there are L elements 7' G A'^ with 

d(7',I„)<i/j. (3.5) 

Then, multiplying by the elements of A' and using the minimality of p, we 
obtain the inequality \A'^\ ^ -^^|^'|- Since A is a i^- approximate group we 
have \A^\ ^ K^\A\, and therefore L ^ n'^'^^K^. 

However, by the inequalities noted in Case 2 of the proof of Lemma 12.21 
any commutator 7' = [7,2;], x G A', will satisfy (13. 5p . It follows that 
there are merely n*^" different values taken by this commutator, and 
hence there is some further set A" C A\ \A"\ ^ n"^" K-^\A\, such that 
[7, x] = [7, y] whenever x, y G A". A very short computation confirms that 
x~^y centralises 7 for any such pair x,y, and this concludes the proof. 

Remark. We note that this argument of the referee uses slightly less than 
our original one, in that only the bounded doubling of balls in the Hilbert- 
Schmidt norm is required, as opposed to the rather more subtle Besicovitch 
property. 

A consequence of Lemma 13. 1^ proven below, is the following. 

Corollary 3.2. Suppose that A C U„(C) is a K -approximate group. Then 
either there is a coset xZ of the centre Z = Ui(C) C U„(C) such that 
\Ar\xZ\ ^ n~-^|^|, or there is an element ^ £ which is not a multiple of 
the identity and commutes with at least n"*^" elements of A? . 

In the proof of this corollary and elsewhere we require two lemmas con- 
cerning the behaviour of approximate groups under intersections and homo- 
morphisms. Related results appear in work of Helfgott [TSj and later papers 
such as pin EI]. 
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Lemma 3.3. Let K ^ 2 be a parameter and let A be a K -approximate 
subgroup oj'G. Let H ^ G be a subgroup. Then A^nH is a 2 -approximate 
group and {A'' n H\ ^ K^'^\A^ n IL\ for every k ^ 1. 

Proof. Let X, \X\ ^ K, be as in the definition of approximate group. 
Then, for any positive integer k, we have 

^ x'^-^A. (3.6) 

Now if g £ G and yi,y2 S gACiH then ^1/2 £ A'^ DH. It follows that 

gAnH C y{A^nH) 

for any y E gA n H (or, if gA fl H happens to be empty, for any y at all). 
Let y be a set consisting of one such value of y for each choice of g G X'^"^. 
It follows from the preceding discussion and (13. 6p that 

A^nH OY{A^nH). 

This confirms the second statement of the lemma. Taking k = 4 and noting 
that {A^ n H)"^ O A^ n H gives 

{A^nHf C Y{A^nH). 

Since A"^ Ci H is symmetric, this implies that 

{A^nHf c {A^nH)Y-^. 

This confirms that n is a 2i^^-approximate group, with covering set 
YUY-K □ 



Lemma 3.4. Suppose that A is a symmetric set in some group G, and that 
TT : G — )• G' is a homomorphism from G into some other group G' . Suppose 
that X ^G' is a set and that \ir{A)nX\ = 5\Tr{A)\. Then \ A^nTr-^{X)\ ^ 
6\A\. 

Proof. Let M be the size of the largest fibre of A above G', that is to say 
maxj; 1^ n 7r~"^(x)|. Then has a fibre of size at least M over idc, and 
thus has a fibre of size at least M over each point of vr(^). In particular, 

\A^ n TT-\x)\ ^ m\tt{A) r\X\^ M6\tt{A)\. 

On the other hand it is clear that \A\ ^ M|7r(A)|. Combining these two 
inequalities leads to the stated bound. □ 

Proof of Corollary \3.S[ Let n be the projection Un(C) — )• PU„(C) whose 
kernal ker(7r) = Z is the centre of U„(C). Let A := 7r-^{7T{A)) D SU„(C) = 
AZn SU„(C). Note that \A\ ^ |7r(A)|. If |7r(yl)| ^ n, then there is a coset 
xZ such that \A n xZ\ ^ n~"'^|74|. If not then \A\ > n and so Lemma l3.ll 
applies (with A in place of A) and we obtain an element 7 in A'^, not a 
multiple of the identity, such that 
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Pushing this forward under tt and noting that fibres of tt in SUri(C) have 
size at most n, we obtain 

|7r(A2) nvr(C7u„(c)(7))l ^ n-^^'^' K-'\^{A)\. 
It follows from Lemma 13.41 that 

l^'nCu„(c)(7)l^n-^'"'i^-Vl, 
and hence from Lemma 13.31 that 

This concludes the proof. □ 

We have established an "approximate" analogue of Lemma [2.2[ It remains 
to complete the proof of Theorem 11.3^ and we do this by proceeding in a 
manner rather analogous to that at the end of ^ that is to say by induction 
on n. 

To make this work efficiently, we prove the following statement. 

Lemma 3.5. Suppose that A is a K- approximate subgroup of some group 
G group {which, in applications, will he a unitary group). Let H ^ G be a 
subgroup isomorphic to U„(C) x Hq for some group Hq and some n ^ 2, and 
suppose that \Ar]Hx\ ^ 6\A\ for some 6 > and some coset Hx. Then there 
is a further subgroup H' , isomorphic to U„j (C) x • • • x U„j.(C) x Ui(C) x Hq 
where Ui < n for all i and ni + • • • + = n, together with an x' such that 
\AnH'x'\ ^ n-^''^5K-^\A\. 

Proof of Lemma 1 3. 5\ The hypothesis |^ni/x| ^ 5\A\ immediately implies 
that \A^ n H\ ^ 6\H\. By Lemma [33] we see that S := A"^ D H is a K^- 
approximate group. By assumption we have 

H ^ U„(C) X Hq. 

The projection 'k{S) onto the first factor U„(C) is another IT "^-approximate 
group and we may apply Corollary 13.21 to it. If we are in the first case of 
that corollary, the lemma follows immediately with H' = Z x Hq, where 
Z ^ Ui(C) is the centre of Un(C). 

If we are in the second case, then there is an element 7 in S'^ such that 
|7r(S')2 n Cu„(c)(7)| ^ n-^"'/C-^|7r(S')|. By Lemma ED this centraliser 
Cu„(c) (7) is isomorphic to a subgroup of some product (C) x • • • x U„j. (C) 
with 71-1 + • • • + nfc = n and Ui < n for all i. Write 

H' := U„, (C) X • • • X U„, (C) X Ui(C) x Hq. 

By Lemma 13.41 we have \S^ n H'\ ^ n'^'-"'^^ K~'~"\S\ and hence, by Lemma 
[3751 that 

p ^/| ^ 1^12 ^ ^/| ^ ^-Cn2^-C|^| ^ SK-^\A\. 

Since ^ is a iC-approximate group, is covered by K translates Ax of A. 
The result follows immediately. □ 
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Proof of Theorem I i . 31 Simply apply Lemma [3 . 5 1 rep eat edly. starting with 
H = U„(C). After at most n steps we end up with some x such that 
|An-fr'x| ^ n~^^ K~'-''"'\A\, where H' is isomorphic to a product of at most 
2n copies of Ui(C) and in particular is abelian. It follows that R H'\ ^ 
„-Cn3^-Cn|^|^ and hence by Lemma [Ol that B := r\ H' satisfies the 
conclusions of Theorem 11.31 □ 



4. A MORE PRECISE RESULT 

Our aim in this section is to establish Corollary II. 4^ a somewhat more 
precise structural conclusion about approximate subgroups of the unitary 
group. Let us begin by recalling the statement. 

Corollary 11.41 Suppose that A C Un(C) is a K -approximate group. Then 
there is a torus S C U,i(C) such that A lies in the normaliser N(S), and such 
that the image of A under the quotient homomorphism tt : N(S) — )• N{S)/S 
has cardinality at most n^'^ K^^ . 

Recall that by a torus we mean a connected abelian subgroup. We will 
find it convenient to introduce the notion of root torus: a root torus is by 
definition the intersection of conjugates of the full diagonal subgroup T of 
Un(C). A root torus is a priori a closed abelian subgroup of U„(C). It is in 
fact connected, as the following lemma shows. 

Lemma 4.1. Every root torus in U„(C) is connected and hence is a torus. 
Moreover it is the intersection of at most n conjugates of the full diagonal 
subgroup T. 

Proof. Let = giTg~^ be a collection of conjugates of the full diagonal 
subgroup T (say with Ti = T). Pick an element 7 G T with distinct eigen- 
values. An element of Un(C) lies in T (resp. Ti) iff it commutes with 7 
(resp. gi^gj^^). On the other hand a diagonal matrix diag(Ai, A„) com- 
mutes with a matrix (aij) if and only Aj = Xj whenever aij / 0. Prom these 
remarks it follows that the intersection f^ - Ti is the subset of T defined by 
the equality of certain eigenvalues. It is thus isomorphic to a direct product 
of at most n copies of the group of complex numbers of modulus one, and 
in particular it is connected. The second assertion of the lemma also follows 
immediately. □ 

Proof of Corollary \1.4\ By our main theorem, there is a conjugate T of the 
full diagonal subgroup of Un(C) and a iC'-^-approximate group B QT which 
n'^"^i^'^"-controls A. In particular, |^^nr| ^ 6\A\, where 5 := n"'^"^_fC"'^"'. 
Let 5 := P|^g^^^aTa~^. Clearly 5" is a root torus and A lies in N{S). 
Moreover, since g'-tori are connected and the dimension of S is at most n, 
there must exist Oj E A"', i = 1, ...,n, ai = In, such that S = fllLi o,iTa~^. 
Set Si = Plj-^j ajTa""*^. We will establish by induction that n S'jl ^ 
where 5i = {SK^'^^^^y^'^ . This statement in the case i = n + 1 implies 



APPROXIMATE UNITARY GROUPS 



13 



that lA^nSl ^ 6n+i\A\. This establishes the corollary since |7r(^)||yl2n5| ^ 
\A^\ ^ K'^\A\, and so \7r{A)\ ^ 6~l^K^ ^ 

The base of the induction i = 2 has already been checked, so we will 
focus on the induction step, assuming i ^ 2. Set Bi = A^n Si and B2 = 
ai{A'^nT)a-^ C ^2n+2_ j^^^^ \B2\ = lA"^ nT\ ^ 6\A\. On the other hand 
B1B2 ^ ^2n+4 g^j^j i^^i ^ by the induction hypothesis. It follows 

that if F is the largest fibre of the map (p : Bi x B2 ^ B1B2 defined by 
4>{bi,b2) = hh, then 

\Bi\\B2\ ^ \F\\BiB2\ ^ |F|||^2n+4| ^ ^2n+3|^| 

and therefore 

|F| ^ 66iK-^''-^\A\. 
Since F is a fibre of 0, there is x G B1B2 with at least \F\ representations 
as 6162 with bi £ Bi and 62 £ -82. Fix one of these representations and let 
X = b'^b'2 be any other. Then we clearly have 

and so b'^'^bi G i?!- Since different values of b'l give different values of b'^^bi, 
it follows that |-Bf fl ^ Note, however, that 

Bl = {A^ n Sif c n 5i, 

whilst 

Bl = ai{A^ n r)2a^"^ C mTa'^, 

whence 

n s| c ^4 n 5i n aiTa^^ = A^r\ Si+i. 

Therefore |A^n5j+i| ^ Since ^ is a K-approximate group, A'^ is covered 
by K^-translates of A. In particular there is some x such that | An S'j+ix| ^ 
K~^\F\, and this immediately implies that \A^ n5j+i| ^ K~^\F\ > 5j+i|j4|, 
the desire to have this last inequality hold being the reason for our particular 
choice of This ends the proof of the induction step and hence the proof 
of the corollary. □ 

Remarks. Suppose that A C U„(C) is a symmetric set satisfying the small 
tripling condition \.^\ ^ -^1^1- Then the conclusion of Corollary 11.41 still 
holds, since then A? is a X'^-approximate group containing A. This follows 
from standard multiplicative combinatorics (see, for example. Proposition 
3.1 in [6]). As a consequence we obtain the following corollary. 

Corollary 4.2. Suppose that A C Un{C) is a symmetric subset with \A'^\ ^ 
K\A\ and that the closure of the subgroup {A) is a connected subgroup of 
U„(C) with no connected abelian normal subgroup. Then \A\ ^ n'^" K'^^ . 

Proof. The set is a X'-^-approximate group, and so by Corollary 11.41 
it must be contained in N{S), where S* is a connected abelian subgroup of 
U„(C). By our assumption, G := (A) is a connected semisimple compact 
group with dimension ^ n^. It is well-known (e.g. see [4]) that the centre 
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of a connected semisimple compact Lie group of dimension d is finite and 
in fact of size at most d. Since S D G is a finite normal subgroup of G, it 
is central (this follows by connectedness of G, since the map G ^ G,g ^ 
gxg~^ is continuous and takes only finitely many values if x belongs to a 
finite normal subgroup) and thus of size at most n?. By Corollary II. 4| 
{A^ n SI ^ n^^"- K^^"^ \A\, and so the result follows immediately. □ 

If A is only assumed to have small doubling, i.e. \A^\ ^ -f^l^l; then 
it follows from the non-commutative Balog-Szemeredi-Gowers lemma (see 
|30j ) that A is i^'-'-controlled by a X*-^-approximate subgroup. In particular, 
applying Theorem II. 3^ we conclude that A is contained in n*^"^' cosets 
of a connected abelian subgroup of U„(C). 

5. On Gromov's theorem 

In this section we show how our main result gives a new elementary proof 
of the fact that non-virtually abelian subgroups of U,i(C) cannot have poly- 
nomial growth, and in fact have growth at least exp(r"). 

Recall that a group G has polynomial growth with exponent d if there is 
a finite symmetric set S of generators such that one has the bound 

^ Br'^ (5.1) 

for all r ^ 1, where B = B-^ does not depend on r. If one set S of generators 
has this property then it is easy to see that any other set S' does too, 
although By.1 may be different. Thus polynomial growth is a well-defined 
property of the group. 

Proposition 5.1. Suppose that G C U„(C) is a finitely generated group 
with polynomial growth. Then G is virtually abelian. 

Proof. Let 5" be a generating set. There are clearly arbitrarily large r for 
which 

is^n^8^i5^n, (5-2) 

since if not the polynomial growth hypothesis would be violated. Call these 
values good, and suppose in what follows that r is good. By standard 
multiplicative combinatorics (see in particular Proposition 3.1 (v) in Part I 
of this series) it follows that A := S^'' is a ii'-approximate group for some 
K = 0{iy. By Theorem Ol there is some abelian group H ^ U„(C) and 
a coset Hx such that \A n Hx\ ^ c„_d|^|) where c„^d > depends only on n 
and d. We therefore have 

nH\ = \A^ nH\;^ CnM\ > c„,d|sn. (5.3) 

Replacing H by the subgroup generated hy A"^ Ci H (if necessary) we may 
assume without loss of generality that H ^ G. Assume that [G : H] =00. 
Then, since S generates G, it is easy to see that S'^ meets at least k different 
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right cosets of H, for every integer k ^ 1. It follows from this observation 
and dSSI) that 

Choosing k > and some good value of r with r > k, we obtain a 

contradiction to (j5.2p . Thus we were wrong to assume that [G : H] = oo, 
and this concludes the proof. □ 

One could run the above argument more carefully to get an explicit upper 
bound on [G : H]. However this observation is redundant here since it is 
known by rather easier arguments that any virtually abelian group G ^ 
U„(C) has an abelian subgroup H with [G : /f] ^ F{n), where F{n) = 
0{n\[n + 1)!). We offer a brief sketch proof of this fact in Appendix iBl 

Using Corollarv 14.21 oris can also prove the following quantitative form of 
the above proposition. 

Proposition 5.2. Let S he a finite symmetric subset ofVniC) and that (S) 
is not virtually abelian. Then ^ 2'^^" for all r ^ 1, where a > depends 
only on n and c = cs > 0. 

Proof. Let G be the closure of the subgroup (E) generated by S, let G^ 
its connected component of the identity, and i := [G : G^]. There is no 
loss of generality in passing to subgroup (S) n G^. Indeed contains a 

generating set for (S) n G^ (see e.g. [9l Lemma C.l]) and we may replace S 
by this subset. As a result, we may assume that G is connected. Let Z be its 
centre and write vr : G — )• G/Z for the quotient. Then, for every r ^ 1, vr(S^') 
generates a dense subgroup of the non-trivial connected centre-free compact 
Lie group G/Z. The contrapositive of Corollary 14.21 therefore applies and 
we obtain an e = e(n) > for which |7r($]^'^)| ^ |7r(S'^)|^"'"^ for every k ^ 1. 
Iterating this clearly leads to a bound of the form |7r(S'')| ^ 2^^ , which 
certainly implies the proposition. □ 

Of course, much stronger results in this context are known. In fact from 
the Tits alternative [33] , the theorem of Milnor [22] and Wolf [35] , and the 
fact that every nilpotent subgroup of U,i(C) is virtually abelian it follows 
that a finitely-generated subgroup G ^ Un(C) which is not virtually abelian 
has exponential growth. Moreover, in view of the uniform Tits alternative 
[5], the exponential growth rate is even independent of S. 

We remark that non polynomial growth for certain subgroups of GL„(C) 
was used as a key ingredient by Gromov in his original work [16j and also, 
subsequently, by Kleiner [21], who needed this fact only for subgroups of 
U„(C). Our arguments here may be inserted into Kleiner's work, thereby 
avoiding any appeal to the Tits alternative. It should be noted that Shalom 
and Tao ^27j also avoid the Tits alternative in the relevant step of their 
variant of Kleiner's proof, appealing instead to the Solovay-Kitaev argument. 
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Appendix A. Simple facts from metric geometry 

We need some facts concerning covering by balls in certain metric spaces. 
If (X, d) is a metric space then we write B{x, r) = {y £ X : d{x, y) < r} for 
the open ball of radius r centred on x and B(x, r) := {y €z X : d{x, y) ^ r} 
for the corresponding closed ball. 

Definition A.l. Let {X,d) be a metric space. We say that X has the weak 
Besicovitch property with constant k if the following is true. If xi, . . . ,Xk S 
X and i/ri,...,rfc E M^o such that the closed balls B{xi,ri) have 
nonempty intersection then there are distinct indices i and j such that Xi 
lies in the open ball B{xj,rj) 

We call this the weak Besicovitch property since it follows easily from the 
usual Besicovitch covering property as detailed, for example, in Theorem 
1.1 of [14J. It seems to be somewhat weaker and easier to prove than that 
property, however. We shall write /cbes(^) for the smallest constant k which 
works in the above definition. 

Example. We have A;bes(I^^) = 8) where is endowed with the Euclidean 
metric and identified with the complex plane. To see that fcbes(^^) ^ 8, 
suppose that xi, . . . , G E? and that ri, . . . , rg G E^o- Let z lie in the 
intersection of all eight of the closed balls B{xi,ri). Perhaps one of the Xi 
coincides with z; if so, suppose it is xg. By the pigeonhole principle there 
is some choice of 1 ^ i < j ^ 7, such that the angle ZxiZXj is less than 
7r/3; this means that \xi — Xj\ is less than either rj ^ |xj — 2:| or ^ |xj — z|, 
and hence that either Xj G B{xj,rj) or Xj G B{xi,ri). On the other hand it 
is clear by considering xj = e^'^*-'/®, j = 1, 2, . . . , 6, X7 = and rj = 1 that 
A;bes(I^^) is not less than 8. 

It is not particularly difficult to adapt the preceding argument to establish 
the following. 

Lemma A. 2. Suppose that M" is endowed with the Euclidean metric. Then 
A:bcs(M") ^ 3" + l. 

Proof. By the argument just outlined for = C, it suffices to show that if 
3" distinct points yi, . . . , y^ are taken on the unit sphere in M"' then there 
are distinct indices i,j such that the angle ZyiOyj is less than vr/S. But if 
there is no such pair of indices then the open spherical caps centred on yi and 
with radius tt/6 are disjoint. We conclude by a volume-packing argument, 
considering the balls of radius ^ centred on the points yi together with the 
one centred at the origin. There are at least 3*^ + 1 of these balls, which are 
disjoint, have radius ^, and are all contained in the ball of radius | about 
the origin. This is impossible since (3" + l)/2" > (3/2)". □ 

This has the following simple corollary, which we used in the proof of our 
main theorem. 
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Corollary A. 3. Let {X, d) be the metric space consisting of the matrices 
X = Matn(C) together with the distance induced from the Hilbert- Schmidt 
norm. Then the weak Besicovitch constant k\^c<i{X) is bounded by 3^" +!.□ 

Appendix B. Virtually abelian subgroups of U„(C). 

Our aim in this appendix is to outline a proof of the following statement. 

Proposition B.l. Suppose that G ^ U„(C) be a virtually abelian group. 
Then there is a normal abelian subgroup H ^ G with [G : H] ^ 0(n!(n+l)!). 

Remark. The rather strong bound we obtain relies heavily on Collins' s 
bound for Jordan's theorem [TT], which in turn depends on the Classification 
of Finite Simple Groups. Inputting softer proofs of Jordan's theorem (such 
as the one we gave in ^ of this paper) would give a vastly more elementary 
argument, but would lead to correspondingly cruder bounds of the form 
exp(Cn'^). 

Proof. Passing to the Zariski closure, we may assume without loss of 
generality that G is an algebraic subgroup. Its connected component of the 
identity is a torus S C U„(C). The centraliser Z{S) of this torus is a direct 
product of unitary groups Um(C) which are permuted by the normaliser 
N{S). In particular, [G : Z{S)] < [N{S) : Z{S)] ^ n\. According to a 
lemma of Platonov, for any algebraic subgroup H ^ GL„(C) there exists a 
finite subgroup F such that H = FH^, where is the Zariski connected 
component of the identity (see [Ml 10.10]). Applying this to H = GCi Z{S), 
we get a finite subgroup F C Z{S) such that G n Z{S) = FS. By Jordan's 
theorem and Collins's bound [11], there is a normal abelian subgroup Fq <^ F 
of index 0{{n + 1)!). Now FqS is abelian and normal in G and of index 
0(n!(n + l)!). □ 

We conclude by remarking that simple examples show that no analogue of 
Proposition IB. II holds in GL„(C). Indeed the group G ^ GL2(C) consisting 
of all upper triangular matrices whose diagonal entries are 1 and an mth 
root of unity is virtually abelian yet has no abelian subgroup of index less 
than m. 
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